# 128K long context
Instella 3B Long Instruct
Other
Instella-Long is an open-source language model with 3B parameters developed by AMD, supporting a context length of 128K and performing excellently in long-context benchmark tests.
Large Language Model
Transformers

I
amd
240
1
Gemma 3 1b It Qat Bnb 4bit
Gemma 3 is a lightweight open model series launched by Google, built on Gemini technology, supporting multimodal input and text output.
Image-to-Text
Transformers

G
unsloth
23
1
Phi 4 Mini Reasoning GGUF
MIT
Phi-4-mini-reasoning is a lightweight open model built on synthetic data, focusing on high-quality, reasoning-rich data, and further fine-tuned for more advanced mathematical reasoning capabilities.
Large Language Model
Transformers

P
Mungert
3,592
3
Openhands Lm 7b V0.1 GGUF
MIT
OpenHands LM is an open-source coding model built on Qwen Coder 2.5 Instruct 32B, which performs excellently in software engineering tasks through special fine-tuning.
Large Language Model English
O
Mungert
1,131
2
Nu2 Lupi Qwen 14B
Apache-2.0
Nu2-Lupi-Qwen-14B is a mathematical reasoning optimized model based on the Qwen 2.5 14B architecture, excelling in complex problem-solving and logical deduction.
Large Language Model
Transformers

N
prithivMLmods
23
2
Gemma 3 27b It Qat Unsloth Bnb 4bit
Gemma 3 is a lightweight, state-of-the-art multimodal open-source model launched by Google, capable of processing text and image inputs and generating text outputs.
Image-to-Text
Transformers

G
unsloth
2,591
1
Gemma 3 4b It Qat Unsloth Bnb 4bit
Gemma 3 is a lightweight, cutting-edge open model series launched by Google, built on Gemini model technology, supporting multimodal input and text output.
Image-to-Text
Transformers

G
unsloth
918
1
Gemma 3 4b It Qat GGUF
Gemma 3 is a lightweight, advanced open model series from Google, built on the same research and technology used to create Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.
Text-to-Image English
G
unsloth
2,629
2
Gemma 3 27b It Qat
Gemma is a lightweight open model series launched by Google, built on Gemini model technology. Gemma 3 is a multimodal model supporting text and image inputs with text outputs, featuring a 128K large context window and multilingual capabilities.
Image-to-Text
Transformers

G
unsloth
168
2
Gemma 3 12b It Qat Unsloth Bnb 4bit
Gemma 3 is a lightweight and state-of-the-art open model family launched by Google, built on the same research and technology as the Gemini model. It supports multimodal input and text output.
Image-to-Text
Transformers

G
unsloth
1,422
1
Gemma 3 12b It Qat GGUF
Gemma is a lightweight, advanced open model series from Google, built using the technology behind the Gemini models. Gemma 3 is a multimodal model capable of processing both text and image inputs to generate text outputs.
Text-to-Image
G
unsloth
4,943
5
Gemma 3 12b It Qat
Gemma 3 is a lightweight, state-of-the-art multimodal open-source model launched by Google. It can process text and image inputs and generate text outputs, suitable for various text generation and image understanding tasks.
Image-to-Text
Transformers

G
unsloth
952
2
Synthia S1 27b Bnb 4bit
Synthia-S1-27b is an advanced reasoning AI model developed by Tesslate AI, focusing on logical reasoning, coding, and role-playing tasks.
Text-to-Image
Transformers

S
GusPuffy
858
1
Gemma 3 27b It Qat Q4 0 Unquantized
Gemma 3 is a lightweight and advanced multimodal open model launched by Google. It is built on the same research and technology as the Gemini model, supporting text and image inputs and generating text outputs.
Text-to-Image
Transformers

G
google
11.53k
23
Gemma 3 12b It Qat Int4 Unquantized
Gemma 3 is a lightweight multimodal open model from Google, supporting text and image inputs with text output, featuring a 128K large context window and multilingual capabilities.
Image-to-Text
Transformers

G
google
1,358
9
Gemma 3 4b It Qat Int4 Unquantized
Gemma 3 is a lightweight multimodal open model launched by Google, supporting text and image input and generating text output. The 4B version has undergone instruction tuning and quantization-aware training, making it suitable for deployment in resource-constrained environments.
Image-to-Text
Transformers

G
google
541
3
Gemma 3 1b It Qat Int4 Unquantized
Gemma is Google's lightweight advanced open model series, built with the same technology as Gemini, supporting multimodal input and text generation.
Large Language Model
Transformers

G
google
507
3
Gemma 3 27b It Qat Compressed Tensors
Gemma 3 is a lightweight and advanced open model series launched by Google, built on the same research and technology as the Gemini model. This version is an instruction-tuned model with 27B parameters, using quantization-aware training (QAT) and compressed tensor technology.
Image-to-Text
G
gaunernst
1,985
6
Gemma 3 12b It Qat Compressed Tensors
Gemma 3 is Google's lightweight cutting-edge open model family, built on the same research and technology used to create Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.
Text-to-Image
G
gaunernst
867
1
Gemma 3 4b It Qat Q4 0 Unquantized
Gemma 3 is a lightweight open-source multimodal model introduced by Google, built on the same technology as Gemini, supporting text and image inputs to generate text outputs.
Image-to-Text
Transformers

G
google
1,159
5
Gemma 3 12b It Qat Q4 0 GGUF
Gemma is a lightweight, cutting-edge open model series from Google, built on Gemini technology. The 12B version is a multimodal model supporting text and image input, featuring a 128K large context window and support for over 140 languages.
Image-to-Text
G
Mungert
1,008
3
Gemma 3 4b It Qat Autoawq
Gemma 3 is a lightweight open-source multimodal model launched by Google, built on Gemini technology, supporting text and image input and generating text output.
Image-to-Text
Safetensors
G
gaunernst
503
1
Gemma 3 4b It Speech
Gemma-3-MM is a multimodal instruction model extended from Gemma-3-4b-it with added speech processing capabilities, capable of handling text, image, and audio inputs to generate text outputs.
Audio-to-Text
Transformers

G
junnei
383
12
Gemma 3 27b It Int4 Awq
Gemma is a lightweight and advanced open model series launched by Google, built on the same research and technology as Gemini. The 27B version is a multimodal model that supports text and image input and generates text output.
Text-to-Image
Transformers

G
gaunernst
17.62k
16
Gemma 3 27b Pt Qat Q4 0 Gguf
Gemma is a lightweight and cutting-edge open model family launched by Google, built on the same research and technology as the Gemini model. Gemma 3 is a multimodal model that can process text and image inputs and generate text outputs.
Image-to-Text
G
google
633
24
Gemma 3 27b It Qat Q4 0 Gguf
Gemma is a lightweight open-source multimodal model series launched by Google. It supports text and image inputs and generates text outputs. It has a 128K large context window and supports over 140 languages.
Image-to-Text
G
google
69.29k
251
Gemma 3 4b It Int4 Awq
Gemma is a lightweight, advanced open model series from Google, built using the same research technology as Gemini. Gemma 3 is a multimodal model capable of processing both text and image inputs to generate text outputs.
Text-to-Image
Transformers

G
gaunernst
1,054
1
Gemma 3 27b Pt Bnb 4bit
Gemma 3 is a lightweight open model series launched by Google, built on the same research and technology as the Gemini model, supporting multimodal input and text output.
Image-to-Text
Transformers English

G
unsloth
2,009
1
Gemma 3 1b Pt Unsloth Bnb 4bit
Gemma 3 is a series of lightweight open models launched by Google, supporting multimodal input (text and images), with a 128K large context window, suitable for various tasks such as question answering and summarization.
Image-to-Text
Transformers English

G
unsloth
4,481
3
Gemma 3 4b It Qat Q4 0 Gguf
Gemma 3 is Google's lightweight cutting-edge open-source multimodal model supporting text and image inputs with text output, featuring 128K context window and 140+ language support
Image-to-Text
G
google
19.81k
120
Gemma 3 1b It Qat Q4 0 Gguf
Gemma is Google's lightweight cutting-edge open model series, built using the same research technology as Gemini. The 1B version is instruction-tuned, suitable for deployment in resource-constrained environments.
Text-to-Image
G
google
4,862
36
Gemma 3 1b It
Gemma 3 is a lightweight advanced open model series launched by Google, built on the same research and technology as the Gemini models. This model is multimodal, capable of processing both text and image inputs to generate text outputs.
Text-to-Image
Transformers

G
google
2.1M
347
Gemma 3 12b Pt
Gemma is a lightweight open-source multimodal model series launched by Google, built on the same technology as Gemini, supporting text and image inputs and generating text outputs.
Image-to-Text
Transformers

G
google
54.36k
46
Phi 4 Multimodal Instruct
MIT
Phi-4-multimodal-instruct is a lightweight open-source multimodal foundation model that integrates language, vision, and speech research and datasets from Phi-3.5 and 4.0 models. It supports text, image, and audio inputs to generate text outputs, with a context length of 128K tokens.
Multimodal Fusion
Transformers Supports Multiple Languages

P
Robeeeeeeeeeee
21
1
C4ai Command R7b Arabic 02 2025
A 7B-parameter large language model optimized for Arabic, supporting 128K context length with excellent performance in enterprise-level tasks
Large Language Model
Transformers Supports Multiple Languages

C
CohereLabs
2,335
101
Phi 4 Multimodal Instruct Onnx
MIT
ONNX version of the Phi-4 multimodal model, quantized to int4 precision with accelerated inference via ONNX Runtime, supporting text, image, and audio inputs.
Multimodal Fusion Other
P
microsoft
159
66
Spec Vision V1
MIT
Spec-Vision-V1 is a lightweight, state-of-the-art open-source multimodal model designed for deep integration of visual and textual data, supporting a 128K context length.
Text-to-Image
Transformers Other

S
SVECTOR-CORPORATION
17
1
Chocolatine 2 14B Instruct V2.0.3
Apache-2.0
Chocolatine-2-14B-Instruct-v2.0.3 is a large language model based on the Qwen-2.5-14B architecture, fine-tuned with DPO, specializing in French and English tasks, and excels in the French LLM leaderboard.
Large Language Model
Transformers Supports Multiple Languages

C
jpacifico
329
14
C4ai Command R7b 12 2024
Command R7B is an open-weight 7B parameter research version model, optimized for diverse scenarios such as reasoning, summarization, Q&A, and coding, supporting 23 languages.
Large Language Model
Transformers Supports Multiple Languages

C
CohereLabs
6,812
381
Phi 3.5
MIT
Phi-3.5 is an advanced large language model developed by Microsoft based on the Phi-3 architecture, focusing on high-quality, reasoning-rich data and supporting a context length of 128K tokens.
Large Language Model
P
cortexso
304
1
- 1
- 2
Featured Recommended AI Models